Method for cloning of a gene for pol I type DNA polymerase

ABSTRACT

The present invention is directed to a method for cloning a gene for Pol I type DNA polymerase comprising; 
     (a) amplifying target DNA with PCR using primers specific to said genes; 
     (b) cloning a gene for Pol I type DNA polymerase with a probe selected from amplified DNA. And this invention is directed to a novel isolated gene coding for Pol I type DNA polymerase cloned in the plasmid.

This is a divisional application of Ser. No. 08/208,036, filed Mar. 9, 1994 now U.S. Pat. No. 5,436,326; which is a continuation of now abandoned application Ser. No. 07/887,282, filed May 22, 1992.

FIELD OF INDUSTRIAL USE

This invention relates to a method how to clone the genes for Pol I type DNA polymerases, which are very useful for genetic engineering research, and also relates to genes that code for new Pol I type DNA polymerase.

STATE OF THE PRIOR ART

On the basis of segmental similarities in the amino acid sequences, DNA polymerases have been classified into two major groups; the Escherichia coli DNA polymerase I family (Pol I type) and the eukaryotic DNA polymerase α family (α type).

DNA polymerases, which are widely used as a reagent in genetic engineering research, currently include DNA polymerase I from Escherichia coli; Klenow fragment, which is a modified form; DNA polymerase from T4 phages; DNA polymerase from T7 phages; heat-stable DNA polymerase from Thermus aquaticus (Taq polymerase); etc. Most of those belong to the Pol I family. These enzymes are used to label specific DNA or in the identification of DNA base sequences, according to their enzymatic properties.

PROBLEM TO BE SOLVED BY THE INVENTION

In general, methods for the use of DNA polymerase differ depending on the specificity of the enzyme, which varies for enzymes of different origins. For example, Bacillus caldotenax has an optimum temperature for growth of about 70° C., so the Pol I type DNA polymerase from this bacterium is probably stable at high temperature, and it should be useful as a reagent for use in genetic engineering research.

However, details about the properties of this DNA polymerase are not known, and a method for its preparation is not available. The structure of the gene that codes for this DNA polymerase and the amino acid sequence of the enzyme are not known, and a method has not been established by which the gene can be isolated, ligated with a vector, and expressed by genetic engineering.

The purpose of this invention is to provide a simple and efficient method for cloning genes that code for novel Pol I type DNA polymerases and to provide the sequence of the gene that codes for a novel Pol I type DNA polymerase.

STEPS TAKEN TO SOLVE THE PROBLEM

To summarize this invention, first, this invention relates to a method for cloning of a gene coding for Pol I type DNA polymerase which comprises the following steps of:

(a) amplifying target DNA with Polymerase Chain Reaction using a primer represented SEQ ID No. 1 or SEQ ID No. 2;

(b) cloning a gene of Pol I type DNA polymerase by conventional method with a probe selected from an amplified DNA of step (a).

Secondly, this invention relates to isolated gene coding for a Pol I type DNA polymerase, wherein the isolated gene is obtainable from the plasmid pUI101 or pUI205.

Third, this invention relates to isolated gene for coding a Pol I type DNA polymerase characterized by the fact that said gene can hybridize with the gene of the second invention in a stringent condition.

The inventors of this invention achieved the invention, a method for the cloning of the gene for Pol I type DNA polymerase with ease and effectively even if the structure of the gene and the amino acid sequences of the gene product of the desired Pol I type DNA polymerase gene are unknown, by designing a pair of primers for use in the amplification of the DNA polymerase genes by the PCR, and with the use of said pair of primers, it is possible to amplify a portion of unknown DNA polymerase genes, and the desired gene for Pol I type DNA polymerase can be cloned with the use of the amplified gene as a probe; by cloning of the gene, it is possible to obtain Pol I type DNA polymerase by the methods of genetic engineering at high yield.

Below, this invention is described in detail.

As the method for the selection of the desired DNA fragments, first, the published amino acid sequences of known Pol I type DNA polymerases are compared with each other, and on the basis of common amino acid sequences that are found, oligodeoxyribonucleotides are synthesized. The amino acid sequences of Pol I type DNA polymerases can be found in, for example, Journal of Biological Chemistry, vol. 257, 1958-1964 (1982), Journal of Molecular Biology, vol. 166, 477-535 (1983), Journal of Biological Chemistry, vol. 264, 6427-6437 (1989), and Journal of Biological Chemistry, vol. 264, 4255-4263 (1989). They are the sequences of DNA polymerases from Escherichia coli, T7 phage, Thermus aquaticus, and Streptococcus pneumoniae, respectively.

The sequences shown as SEQ ID Nos. 1 and 2 in the sequence listing are the sequences of mixed primers for use in the PCR that were designed on the basis of conserved sequences in Pol I type DNA polymerases by the inventors of this invention. The sequence shown as SEQ ID No. 1 in the sequence listing is a mixed primer found by the inventors to be a conserved sequence in Pol I type DNA polymerases; that is, it was designed on the basis of amino acid sequences shown as SEQ ID Nos. 3-6 in the sequence listing. The sequence shown as SEQ ID No. 2 in the sequence listing is a mixed primer found to be a conserved sequence at another region of Pol I type DNA polymerases; that is, it was designed on the basis of amino acid sequences shown as SEQ ID Nos. 7-10 in the sequence listing. This pair of primers can be used to amplify the Pol I type DNA polymerase gene from, for example, Escherichia coli, T7 phage, Thermus aquaticus, and Streptococcus pneumoniae with efficiency. It is possible to use said pair of primers as primers in the PCR done to clone the Pol I type DNA polymerase gene. The primers that can be used in this invention can be any primers that can hybridize with the conserved sequences of said genes and amplify said genes with efficiency, any primers derived from the mixed primers described above, any primers designed on the basis of other conserved sequences, or any combination of primers designed from conserved sequences and vectors.

Cloning of the genes for Pol I type DNA polymerases, transformation of the E. coli host strain by the plasmid containing the genes for the polymerases, and the purification of the polymerases can be performed by the following steps, given as an example.

1. Chromosomal DNA is isolated from cells having any Pol I type DNA polymerase.

2. Oligonucleotide primers for use in Pol I type DNA polymerase gene amplification shown as SEQ ID Nos. 1 and 2 in the sequence listing, which sequences are based on the region coding for DNA polymerase, are prepared, and the polymerase chain reaction is performed with the DNA obtained in step 1 above as the template.

3. The DNA obtained in step 1 above is cleaved with suitable restriction enzymes, the fragments obtained are used as probes to screen the DNA fragments obtained in step 2, and the desired DNA fragments are obtained.

4. Vectors are cleaved by an appropriate restriction enzymes, and the DNA obtained in step 3 is ligated into the cleaved site.

5. Vectors with the ligated DNA fragments are introduced into host cells, and transformants that contained the desired DNA fragments are selected.

6. Plasmids are isolated from the transformants produced in step 5, the desired DNA fragments are removed, if necessity, and on the basis of the restriction map, the desired gene is recreated in its entirety as a continuous genomic fragment, and this is ligated as summarized in step 4 in an expression vector.

7. Expression vectors carrying the desired DNA fragment are introduced into host cells as described in step 5 to give transformants.

8. The transformants obtained in step 7 are cultured, and produce DNA polymerase in E. coli cell.

9. Exonuclease III is used, if necessity, to produce polymerase in which the 5'43 3'-exonuclease coding region is missing from the region coding for entire DNA polymerase.

10. The expression vectors obtained in step 9 are introduced into host cells to produce transformants, and the transformants produce mutant DNA polymerase.

11. The transformant obtain ed in step 10 is cultured and the mutant DNA polymerase is purified from the cultured cells.

The bacterial strain that is used in this invention can be any bacterial strain that produces DNA polymerase, such as, for example, Bacillus caldotenax YT-G (Deutsche Sammlung von Mikroorganismen accession number DSM406).

Below is the explanation of this invention using B. caldotenax YT-G as one example.

DNA from the strain used to produce the desired DNA, B. caldotenax YT-G (DSM406), is extracted from a bacterial culture that has been cultivated with shaking at 70° C. Extraction, purification, the cleaving with restriction enzymes, and the like can be done by any of the published methods, such as those published in Molecular cloning: A laboratory manual by T. Maniastis et al., on pages 75-178 (Cold Spring Harbor Press, 1982).

The inventors of this invention used as primers the two oligonucleotides with the SEQ ID Nos. 1 and 2 in the sequence listing, which are based on the common amino acid sequences found by comparison. The inventors used DNA from B. caldotenax as the template in the PCR to amplify specific DNA fragments, and found that the amino acid sequence deduced from the base sequence of the DNA fragments obtained was very similar to that of other known DNA polymerases. Said DNA fragments can be used as probes in hybridization to select the desired DNA. The hybridization method used for selection can be any of the published methods, such as that on page 309 of the book mentioned above, Molecular cloning; A laboratory manual.

By Southern hybridiztion, the gene for the desired DNA polymerase can be located in restriction fragments from B. caldotenax, and the selected restriction enzymes, such as EcoRI, BamHI, HincII, HindIII, XhoI, PstI, and PvuII, can be used to digest B. caldotenax DNA, which is then ligated to plasmid vectors. The plasmid vectors can be any of the known ones, such as, for example, pUC18, pUC19, pTV118N, etc.; the plasmid vectors that can be used are not limited to this list. The procedure used to insert the DNA fragment can be any of the known methods, such as by use of an enzyme reaction with DNA ligase for the insertion.

Next, the recombinant plasmids are introduced into host cells of Escherichia coli, or into any wild strain or mutant strain of host cells that can be transformed; it is preferable to use a mutant defective in the restriction system (restrictions⁻, modification⁺). The procedure used for the introduction can be any of the known methods, such as that on page 250 of the book mentioned above, Molecular cloning; A laboratory manual.

In this way, the desired DNA fragment is introduced into host cells, and clones are selected according to the characteristics of the plasmid vector used; for example, when pUC18 is used, colonies are selected for ampicillin resistance. By this method, it is possible to obtain groups of cells in which the desired DNA is cloned. From the colonies obtained, clones that have the desired fragment can be selected. The method of selection is by colony hybridization with a variety of vectors, and if plaque hybridization is used, any of the published methods can be used.

Three clones were selected, and digestion with restriction enzymes and analysis of the findings obtained (that they had HincII fragments, HindIII fragments, or XhoI fragments) gave the basis on which the three fragments were relinked in the test tube, giving one continuous DNA fragment, which DNA fragment was ligated with the expression vector pTV118N, giving a desired clone. This plasmid produced in this way was designated pUI101.

Cells of E. coli containing plasmid pUI101 were cultured, and a crude extract of the harvested cells was obtained.

The extract was treated at 60° C. for 20 minutes, after which heat treatment DNA polymerase activity was still found, although a cell extract from E. coli with the expression vector alone (without the desired DNA fragment) had no such activity. This showed that the bacterial cells carrying pUI101 produced a heat-resistant DNA polymerase, and that the gene for the coding of this enzyme was in fact expressed in the cells of E. coli.

The construction of plasmid pUI101 was as shown in FIG. 1. The gene that coded for DNA polymerase was in an NcoI fragment of about 3.5 kilobases in plasmid pUI101, and the restriction map of said NcoI fragment is shown in FIG. 2. Its base sequence is shown as SEQ ID No. 11 in the sequence listing. The cells of E. coli that grew best as host cells when transformed with pUI101 were E. coli HB101, and the transformed cells were designated Escherichia coli HB101/pUI101, and deposited as FERM BP-3721 at the Fermentation Research Institute, Agency of Industrial Science and Technology, Japan.

When E. coli cells carrying pUl101 are cultured, it is possible to obtain heat-resistant DNA polymerase from the cultured cells, which express a large amount of such heat-resistant DNA polymerase. The method for the purification of the DNA polymerase can be, for example, sonication of the cultured cells, heat-treatment of the sonicated suspension, column chromatography on DEAE-cellulose, column chromatography on phosphocellulose to give a single band of DNA polymerase on SDS-polyacrylamide gel electrophoresis (SDS-PAGE).

The DNA polymerase obtained is a polypeptide that is in the position of the molecular weight of 100,000 by SDS-PAGE. Its DNA synthetic activity includes that of 3'-5'-exonuclease and that of 5'→3'-exonuclease.

The amino acid sequence of the purified protein obtained was analyzed, and it was possible to identify the N-terminal amino acid sequence. The sequence is shown as SEQ ID No.12 of the sequence listing. This amino acid sequence was found in the translational frame of the NcoI fragment mentioned above, the sequence of which is SEQ ID No. 13 of the sequence listing. The structural gene of the DNA polymerase of this invention was identified, and its entire amino acid sequence was found and is shown as SEQ ID No. 14 in the sequence listing.

When DNA polymerase from E. coli transformants was studied, it seemed that the 5'→3'-exonuclease activity was present in a domain at the amino-terminal of the polypeptide, so DNA fragments from B. caldotenax in pUI101 were prepared that had a defined portion of the DNA fragment missing, and these were used to transform E. coli, which still had DNA synthetic activity, but clones that lacked 5'→3'-exonuclease activity could be selected. To prepare the deletion plasmid, the method of Henicoff published in Gene, vol. 28, 351-359 (1982) can be used. The plasmid selected was designated pUI205, and used to transform E. coli cells, which were deposited as Escherichia coli HB101/pUI205 (FERM BP-3720) at the Fermentation Research Institute, Agency of Industrial Science and Technology, Japan.

E. coli cells carrying pUI205 can be cultured, and heat-resistant DNA polymerase can be obtained from the cultured cells, which express a large amount of such heat-resistant DNA polymerase. It is possible to purify the DNA polymerase from the cultured cells by the methods described above or the like until the enzyme gives a single band on SDS-PAGE.

By amino acid analysis of the purified protein, it is possible to identify the N-terminal amino acid sequence. This sequence is shown as SEQ ID No. 15 in the sequence listing. This sequence is lacking the 284 amino acids from Met 1 to Lys 284 of SEQ ID No. 14 of the sequence listing. The entire amino acid sequence of the gene that codes for this mutant form of DNA polymerase has been identified. SEQ ID No. 16 of the sequence listing is the base sequence coding for the mutant DNA polymerase, and SEQ ID No. 17 of the sequence listing is the amino acid sequence of the mutant form of the enzyme.

It is possible to prepare the heat-resistant enzymes on an industrial scale, because by the cultivation of E. coli HB101/pUI101 or E. coli HB101/pUI205, 1 ml of culture broth gave 127 units or 212 units of DNA polymerase activity (the non-mutant form and the mutant form, respectively).

As described above in detail, with this invention, the following usual steps for cloning of the gene for Pol I type DNA polymerase are not needed:

(a) checking for the production of Pol I type DNA polymerase;

(b) isolation of the enzyme;

(c) identification of a partial amino acid sequence;

(d) synthesis of a probe from the identified amino acid sequence.

Without use of these steps, the gene for the desired Pol I type DNA polymerase can be simply and effectively obtained.

FIG. 1 is a diagram of the procedure used to construct pUI101.

FIG. 2 is a restriction map of the gene for DNA polymerase cloned in pUI101.

FIG. 3 is a restriction map of the gene for DNA polymerase cloned in pUI205.

EXAMPLES

Below, this invention will be explained with reference to examples, but the invention is not to be taken to be limited to these examples.

Example 1

Two oligodeoxyribonucleotides having the sequences that are shown as SEQ ID No. 1 and No. 2 in the sequence listing were synthesized and purified for PCR primers. In the next step, PCR were done using genomic DNAs prepared from E. coli, T7 phage, T. aquaticus, T. thermophilus, S. pneumoniae, Bacillus subtilis, B. stearothermophilus, B. caldolyticus, Lactobacillus bulgaricus, L. homohiochii, and L. heterohiochii, as templates. Each reaction mixture included 100 pmol of each oligonucleotide primer and 1 ng of the genomic DNA in 100 volumes. Thirty cycles of the carried out, with each cycle consisting of 30 sec at 95° C., 1 min at 55° C., and 2 min at 72° C. Then 5 μl portions of each reaction mixture were put directly on a 0.8% agarose gel. All of these PCR reactions resulted in about 600 bp product. These fragments were cloned into SmaI site of M13 vector and determined the nucleotide sequences.

Example 2

2-1. Preparation of chromosomal DNA from B. caldotenax

First , B. caldotenax YT-G was grown in 125 ml of L medium (10 g/l Bactotryptone, 5 g/l yeast extract, and 5 g/l NaCl, at pH 7.2) at 65° C. overnight with shaking, and the bacterial cells were harvested and suspended in 4 ml of 25% sucrose containing 0.05M Tris-HCl (pH 8.0). To the suspension, 800 μl of lysozyme (5 mg/ml) was added, and the mixture was left at 20° C. for 1 hour. Then 24 ml of SET solution (20 mM Tris-HCl, pH 8.0, 1 mM EDTA, and 150 mM NaCl) was added, after which 4 ml of 5% SDS and 400 μl of proteinase K (10 mg/ml) were added, and the mixture was kept at 37° C. for 1 hour. After phenol extraction and chloroform extraction, ethanol was added to precipitate long fragments of DNA, which were removed from the suspension with a sterilized toothpick. By this procedure, 3.1 mg of DNA was obtained.

2-2. Amplification of specific DNA by the PCR

With 100 pmol of each of the two oligodeoxyribonucleotides shown in the sequence listing as SEQ ID Nos. 1 and 2, and with 1 ng of DNA from B. caldotenax in a total volume of 100 μl, 30 cycles of the PCR were carried out, with each cycle consisting of 30 seconds at 95° C., 1 minute at 55° C., and 2 minutes at 72° C. Then 5 μl of the reaction mixture was sampled and analyzed by agarose gel electrophoresis. The analysis showed that a DNA fragment 600 base pairs long had been amplified specifically. This DNA fragment was ligated into M13mp18, the phage vector having been cleaved with SmaI. The base sequence was found by the dideoxy method.

2-3. Detection of the desired gene by the genomic Southern method.

DNA from B. caldotenax was digested with 5 μg of each of the following enzymes: EcoRI, BamHI, HindIII, HincII, XhoI, PstI, and PvuII. The digest was treated by agarose gel electrophoresis. The DNA in the gel was transferred to a nylon membrane, and then hybridization was done with the DNA fragment of 600 base pairs described above as being obtained by the PCR as the probe. The probe was labelled radioactively by the random priming method. Hybridization was done in 6× SSC that contained 1% SDS, 5× Denhardt's, solution, and 10 μg/ml calf thymus DNA at 65° C. for 5 hours. Then the membrane was washed in 1× SSC containing 0.1% SDS for 1 hour, and used to expose X-ray film, giving an autoradiogram.

2-4. Cloning of DNA fragments containing the gene for DNA polymerase

To clone the DNA fragments found to be positive during genomic Southern analysis, the 2.40-kb HindIII fragment, the 1.45-kb HincII fragment, and the 2.1-kb XhoI fragment of DNA from B. caldotenax were obtained by digestion of 100 μg of each DNA with the necessary restriction enzyme (HindIII, HincII, or XhoI) as appropriate, and the DNAs of the desired sizes were obtained by electrophoresis on agarose gel. Collections were done by adsorption onto glass beads. Plasmid pTV118N was linearized with the same three enzymes, and alkaline phosphatase was used to remove the phosphorylated residues at the terminals. Then the DNA was ligated to the vector with DNA ligase, and the vectors were introduced into cells of E. coli JM109. The transformants obtained were then treated by colony hybridization for selection of the desired clones.

From 50 to 200 colonies of recombinants grown on a nylon membrane were denatured in a mixture of 0.5N sodium hydroxide and 1.5M sodium chloride, and were then neutralized with a mixture of 1M Tris-HCl and 1.5M sodium chloride (pH 7.0). DNA was fixed on the membrane with ultraviolet light. The preparation of the probe and the hybridization conditions were the same as those used in genomic Southern analysis.

2-5. Restriction analysis of cloned fragments and reconstitution of the DNA polymerase gene

From the results of restriction mapping of the three DNA fragments obtained, it was found that the fragments overlapped when arranged in the order of the HincII fragment, the HindIII fragment, and the XhoI fragment. The fragments formed a continuous part of the chromosomal DNA. Restriction sites were selected so that unneeded portions would be eliminated as far as possible, and the three DNA fragments were ligated with the vector pTV118N at the same time, as shown in FIG. 1. In this way, a plasmid that contained about 3.5 kb of DNA fragment that included the gene for DNA polymerase was constructed and designated pUI101.

FIG. 1 shows the construction of pUI101. FIG. 2 shows the restriction map of the NcoI DNA fragment, which included the gene for DNA polymerase, that was cloned in pUI101.

Next, said plasmid was used to transform cells of E. coli HB101, and the transformants were deposited as Escerichia coli HB101/pUI101 (FERM BP-3721).

2-6. Culture of transformants and preparation of a crude extract Cells of E. coli HB101 (FERM BP-3721), which contained the recombinant plasmid pUI101 described above, were cultured at 37° C. in 5 ml of L medium that contained 100 μg/ml ampicillin. When the absorbance of the culture broth reached 0.6 (A₆₀₀), isopropyl-β-D-thiogalactoside (IPTG), the derivative of the substrates for β-galactosidase, was added to the culture, and culture was continued for 15 hours more.

Cells in 1 ml of culture broth were harvested and washed in 50 mM Tris-HCl (pH 8.0) containing 25% sucrose. The cells were lysed again in the same solution, to which the same volume of lysis solution (50 mM Tris-HCl, pH 7.5, 25% sucrose, 60 mM spermidine-HCl, 20 mM sodium chloride, and 12 mM dithiothreitol) was added, and the mixture was left for 45 minutes at 4° C. Then 20 μl of 5% (w/v) Triton X100 was added to the mixture, which was left for 5 minutes at 37° C. The supernatant obtained by centrifugation was incubated for 20 minutes at 60° C., and centrifuged again. The supernatant obtained in this step was the crude extract.

2-7. Assay of DNA polymerase activity

A reaction mixture was prepared that contained 67 mM potassium phosphate, pH 7.4, 6.7 mM magnesium chloride, 1 mM 2-mercaptoethanol, 20 μM activated DNA, 33 μM each dATP, dCTP, dGTP, and TTP, and 60 nM ³ H!TTP. An appropriate amount of the crude extract was added to 150 μl of this solution, and reaction was allowed to proceed for 5 minutes at 60° C., after which the reaction was stopped by the addition of 1 ml of a mixture of 50 mM pyrophosphoric acid and 10% trichloroacetic acid. The reaction vessel was placed in ice for 5 minutes, and the entire reaction mixture was filtered on a glass filter under reduced pressure. The filter was washed several times with 10% trichloroacetic acid, and then with 70% ethanol before being dried and put in a liquid scintillation counter for the counting of radioactivity. There were 127 units of DNA polymerase activity in 1 ml of culture broth.

2-8. Production of heat-resistant DNA polymerase by E. coli cells carrying plasmid pUI101.

From 2.2 g of cells of E. coli HB101/pUI101, 20 ml of crude extract was obtained by the methods described in Example 2-6. This extract was incubated at 60° C. for 30 minutes, and the protein denatured by heat was removed by centrifugation for 10 minutes at 10000×g. To the supernatant, ammonium sulfate was added, and the fraction that precipitated at 30-80% saturation was dialyzed against DE buffer (50 mM Tris-HCl, pH 7.0, 0.2 mM 2-mercaptoethanol, 10% glycerol, and 4 μM phenylmethanesulfonyl fluoride). Then the same buffer was used to equilibrate a column of 15 ml of DE52 (Whatman) and the extract was eluted from the column with a linear gradient of NaCl concentrations from 0 to 300 mM. Then the DNA polymerase activity was assayed by the method of Example 2-7. The fractions of DE buffer that contained activity were pooled and put on a column of 15 ml of P11 (Whatman) equilibrated with DE buffer. Then elution was done with a linear gradient of NaCl concentrations from 0 mM to 300 mM, and the fractions with activity were pooled. The P11 fractions were analyzed by SDS-PAGE, and a single band at the molecular weight of 100,000 was found.

2-9. Identification of the N-terminal amino acid sequence by an amino acid analyzer

The DNA polymerase obtained in example 2-8 was analyzed with an amino acid analyzer, and the amino acid sequence of the N-terminal region was that shown as SEQ ID No. 12 in the sequence listing.

Example 3

3-1. Preparation of plasmids with a regional deletion To eliminate 5'→3'-exonuclease activity, which was deduced to be present at a domain in the amino-terminal side of DNA polymerase protein, plasmids were prepared with regional deletions from the 5'-end of the gene. So that the method that uses exonuclease III could be employed, first, the NcoI fragment about 3.5 kb long carried by pUI101 was cut out and made blunt-ended, after which it was ligated at the HincII side of pTV118N. Then double digestion of the 3'-protruding ends with KpnI and of the 5'-protruding ends with XbaI was done, and exonuclease III was used to digest only the 3'-protruding ends. Mung bean nuclease was used to make blunt ends, and then DNA ligase was used to restore the original circular shape. By adjustment of the time of the exonuclease reaction, mutants with deletions of a variety of sizes could be obtained. By ligation with NcoI linker before recircularization, the initiation codon could be inserted in an appropriate location, and depending on the location of the deletion, the reading frame came to be that of the DNA polymerase gene in one-third of the cases (that is, the probability that the reading frames matched was one-third).

The plasmid constructed as described above was introduced into E. coli cells; of the transformants obtained, 20 clones were selected at random and their crude extract was prepared and assayed for DNA polymerase activity. There were DNA synthetic activities at 60° C., so the base sequences were analyzed. One of the clones with activity was selected and the carried plasmid was designated pUI205. The 2-kb DNA fragment shown in FIG. 3 was inserted into pUI205. Next, the plasmid was used to transform E. coli HB101 cells, and these were designated Escherichia coli HB101/pUI205 cells (FERM BP-3720).

FIG. 3 is a restriction map of the gene for DNA polymerase that was cloned into pUI205.

3-2. Culture of recombinants and preparation of crude extract

The recombinants mentioned above, FERM BP-3720, were cultured and a crude extract was prepared from the cultured cells by the methods of Example 2-6.

3-3. Assay of DNA polymerase activity.

The crude extract obtained above was assayed for DNA polymerase activity by the methods of Example 2-7. The crude extract had 212 units of DNA polymerase activity in 1 ml of culture broth.

3-4. Assay of 5'→3'-exonuclease activity

As a substrate, plasmid pBR322 was cleaved with PvuII, and the fragment 322 base pairs long was treated with γ-³² P!ATP and polynucleotide kinase to phosphorylate it. Then the enzyme standard prepared in Example 3-2 were mixed with a solution containing 67 mM potassium phosphate (pH 7.4), 6.7 mM magnesium chloride, and 1 mM 2-mercaptoethanol and with the substrate, and reaction was allowed to occur for 5 minutes at 60° C. Then the substrate DNA was made to precipitate by the addition of ethanol.

The radioactivity in the supernatant was counted on a liquid scintillation counter, and the amount of product produced by exonuclease was calculated. In the enzyme used, 5'→3'-exonuclease activity was not detected.

3-5. Production of heat-resistant DNA polymerase by E. coli cells carrying plasmid pUI205

By the methods of Example 2-8, standard enzyme was obtained from cells of E. coli HB101/pUI205. The obtained enzyme was analyzed by SDS-PAGE, and it gave a single band at the molecular weight of 67,000.

3-6. Sequencing of the N-terminal sequence by an amino acid analyzer

By the methods of Example 2-9, the N-terminal amino acid sequence of the standard enzyme was found to be that shown as SEQ ID No. 15 in the sequence listing.

Example 4

4-1. Determination of the base sequence of the chromosomal DNA from B. caldotenax including the structural gene for DNA polymerase

By the methods of Example 3-1, a number of deletion mutants of a variety of sizes were prepared, and their base sequences were identified by the dideoxy method. The data obtained were analyzed and the base sequence of the entire NcoI-NcoI fragment obtained from pUI101 was found to be that of SEQ ID No. 11 in the sequence listing. The sequence of pUI205 was found to be that of base numbers 1032-3252 of the NcoI fragment of pUI101 shown as SEQ ID No. 11 in the sequence listing.

Thus, based on the N-terminal amino acid sequence identified as described in Example 2-9, and Example 3-6, and the base sequence identified in this example, we identified the structural gene of the DNA polymerase of this invention and the amino acid sequence of the DNA polymerase of this invention.

As explained above in detail, this invention provides a simple and efficient method for cloning genes that code for novel Pol I type DNA polymerases and provides said genes. This invention also provides a method for production of a Pol I type DNA polymerase, which is useful as a reagent in genetic engineering research.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 17                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE: No                                                            (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM:                                                                  (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GAYCCHAACYTSCARAAYATHCC23                                                      (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: other nucleic acid                                         (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE: Yes                                                           (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM:                                                                  (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        KASNAKYTCRTCRTGNACYTG21                                                        (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 8 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Escherichia coli                                                 (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        AspProAsnLeuGlnAsnIlePro                                                       15                                                                             (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 8 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: T7 phage                                                         (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        PheProAsnLeuAlaGlnIlePro                                                       15                                                                             (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 8 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Thermus aquaticus                                                (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        AspProAsnLeuGluAsnIlePro                                                       15                                                                             (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 8 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Streptococcus pneumoniae                                         (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        AspProAsnLeuGluAsnIlePro                                                       15                                                                             (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Escherichia coli                                                 (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        GlnValHisAspGluLeuVal                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: T7 phage                                                         (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        TrpValHisAspGluIleGln                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Thermus aquaticus                                                (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        GlnValHisAspGluLeuVal                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acid residues                                              (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: internal fragment                                           (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Streptococcus pneumoniae                                         (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       GluValHisAspGluIleVal                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 3252 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: genomic DNA                                                (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Bacillus caldotenax                                              (B) STRAIN: YT-G(DSM406)                                                       (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       CCATGGATATATTACCGTAGCGAGCAAAGTGGGGCGCGGCACCGTGTTCACGATCCATTT60                 TCCAAAGCCGGGGCGGTAGCCGGCTTCTTTTTATCATCTCCAACTGAGAAGCCTGCCATT120                TTTCAGCGTGACGTGAGCACGGGATGAATCCGCGCCTCCCATCATGTTGGGAGAGCGTTC180                AAGGCAAGCCGCAGGCATGGTACAATAGGACAAGGAAGCATCCGAGGAGGGATGAGA237                   TTGAAAAAAAAGCTTGTTTTAATCGACGGCAGCAGCGTGGCGTAC282                               CGCGCCTTTTTCGCCTTGCCGCTTTTGCATAACGACAAAGGCATC327                               CATACGAACGCCGTCTACGGGTTTACGATGATGTTGAATAAAATT372                               TTGGCGGAAGAAGAGCCAACTCATATGCTTGTCGCGTTTGACGCC417                               GGGAAAACGACGTTCCGGCATGAAGCGTTTCAAGAGTATAAAGGT462                               GGGCGCCAGCAGACGCCACCGGAGCTGTCGGAGCAGTTTCCGCTG507                               TTGCGCGAGCTGCTGAGGGCGTATCGCATCCCCGCCTATGAACTC552                               GAGAACTACGAAGCGGACGATATTATCGGAACGCTTGCCGCCCGC597                               GCTGAGCAGGAAGGGTTTGAGGTGAAAGTCATTTCCGGCGACCGC642                               GATCTGACCCAGCTCGCCTCCCCCCATGTGACGGTGGACATTACG687                               AAAAAAGGGATTACCGATATCGAACCGTACACGCCGGAGGCGGTC732                               CGCGAAAAATACGGCTTAACTCCGGAACAAATCGTTGATTTGAAA777                               GGATTGATGGGCGACAAATCGGACAACATTCCCGGAGTGCCGGGC822                               ATCGGGGAAAAGACGGCGGTCAAGCTGCTCAGGCAATTCGGCACG867                               GTCGAAAACGTGCTTGCCTCCATTGACGAGATCAAAGGCGAAAAG912                               TTGAAAGAAACGCTGCGCCAACACCGGGAGATGGCGCTGTTAAGC957                               AAAAAGCTCGCCGCCATTCGCCGCGACGCCCCGGTCGAGCTCTCG1002                              CTTGATGACATCGCCTATCAAGGGGAAGACCGGGAGAAAGTGGTC1047                              GCTTTATTTAAAGAGCTTGGGTTTCAATCGTTTTTAGAGAAAATG1092                              GAATCGCCGTCATCAGAAGAGGAAAAACCGCTTGCCAAGATGGCA1137                              TTTACGCTTGCTGACCGCGTGACGGAGGAGATGCTTGCCGACAAG1182                              GCGGCGCTTGTCGTTGAAGTGGTCGAGGAAAATTATCATGATGCG1227                              CCGATCGTCGGCATCGCTGTGGTCAACGAACATGGACGGTTTTTC1272                              CTGCGCCCGGAGACGGCGCTTGCCGATCCGCAGTTTGTCGCCTGG1317                              CTTGGTGATGAAACGAAGAAAAAAAGCATGTTTGACTCAAAGCGC1362                              GCGGCAGTCGCCTTGAAATGGAAAGGAATTGAGCTATGCGGCGTT1407                              TCCTTTGATTTATTGCTGGCCGCCTATTTGCTTGATCCGGCGCAA1452                              GGTGTTGATGATGTGGCTGCCGCAGCAAAAATGAAGCAATACGAA1497                              GCGGTGCGCCCGGATGAAGCGGTGTATGGCAAAGGGGCGAAGCGG1542                              GCCGTGCCGGATGAGCCAGTGCTCGCCGAGCATTTGGTCCGCAAG1587                              GCGGCGGCGATTTGGGCGCTCGAACGGCCGTTTTTGGATGAGCTG1632                              CGCCGCAACGAACAAGATCGGTTGCTCGTCGAGCTCGAGCAGCCG1677                              TTGTCTTCGATTTTGGCGGAAATGGAATTTGCCGGAGTGAAAGTG1722                              GATACGAAGCGGCTCGAACAGATGGGCGAAGAGCTCGCCGAGCAG1767                              CTGCGCACGGTCGAGCAGCGCATTTATGAGCTCGCCGGCCAAGAA1812                              TTCAACATCAATTCACCGAAACAGCTCGGCGTCATTTTATTTGAA1857                              AAACTGCAGCTGCCCGTCTTGAAAAAAAGCAAAACCGGCTACTCC1902                              ACTTCGGCGGATGTGCTTGAAAAACTTGCGCCTTATCACGAGATC1947                              GTGGAAAACATTTTGCAACATTACCGCCAGCTTGGCAAGTTGCAG1992                              TCGACGTATATTGAAGGATTGCTGAAAGTCGTGCGACCCGATACA2037                              AAGAAGGTGCATACGATTTTCAATCAGGCGTTGACGCAAACCGGA2082                              CGGCTCAGCTCGACGGAGCCGAACTTGCAAAACATTCCGATTCGG2127                              CTTGAGGAAGGACGGAAAATCCGCCAAGCGTTCGTGCCGTCGGAG2172                              TCTGATTGGCTCATTTTCGCTGCCGACTACTCGCAAATTGAGTTG2217                              CGCGTCCTCGCCCATATTGCGGAAGATGACAATTTAATGGAAGCG2262                              TTCCGCCGCGATTTGGATATCCATACGAAAACAGCGATGGACATT2307                              TTCCAAGTGAGCGAGGACGAAGTGACGCCCAACATGCGCCGTCAG2352                              GCGAAGGCGGTCAACTTTGGGATCGTTTACGGGATCAGTGATTAC2397                              GGCTTGGCGCAAAACTTAAATATTTCACGCAAAGAGGCCGCTGAA2442                              TTCATCGAGCGCTACTTCGAAAGCTTCCCTGGCGTGAAGCGGTAT2487                              ATGGAAAACATTGTGCAAGAAGCAAAACAGAAAGGGTATGTGACG2532                              ACGCTGCTGCATCGGCGCCGCTATTTGCCGGATATTACGAGCCGC2577                              AACTTCAACGTCCGCAGCTTTGCTGAACGGATGGCGATGAACACG2622                              CCGATTCAAGGGAGCGCCGCTGACATTATTAAAAAGGCGATGATC2667                              GATCTGAACGCCAGACTGAAGGAAGAGCGGCTGCAAGCGCGCCTT2712                              TTGCTGCAGGTGCATGACGAGCTCATTTTGGAGGCGCCGAAAGAA2757                              GAGATGGAGCGGCTGTGCCGGCTCGTTCCGGAAGTGATGGAGCAA2802                              GCGGTCACACTTCGCGTGCCGCTCAAAGTCGATTACCATTACGGC2847                              TCGACATGGTATGACGCGAAATAAAAAGGAGTCTTGGTGTGTGGATCGCCG2898                        GCACCCCTAAAAGGCCGGTGATTTAAGGGGAAATACTGCTCTCCAACAGTGTTTCTCAAA2958               TTGAAAAACCTTGCAACACCATCACTTCATTCCTTGTGATTTCTCATAAATCAAGCGAAT3018               CCATTGTTTTTCATCAGCCTTCTAAGAAGGCCTGTGATGGAATGAAAAAGCAGTTTCACA3078               ACGACTCTTCTCCAGTTGAGAAGCCTTGGGACATCGAGTCGTCCTTCTCAACCAACATGA3138               CCGATTTTGCGAAAATCAGCGTTTCTCACCGGCCTTCTAGGCAGAATCTTTCGGTGCGAC3198               GATTCTCGGCTGCAACTCGGATGAATTGGAGCGAAATCAGCTGCCGCCCCATGG3252                     (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 22 amino acid residues                                             (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: N-terminal fragment                                         (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM:                                                                  (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       MetLysLysLysLeuValLeuIleAspGlySerSerValAlaTyr                                  151015                                                                         ArgAlaPhePheAlaLeuPro                                                          20                                                                             (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2631 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: genomic DNA                                                (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Bacillus caldotenax                                              (B) STRAIN: YT-G(DSM406)                                                       (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       TTGAAAAAAAAGCTTGTTTTAATCGACGGCAGCAGCGTGGCGTAC45                                CGCGCCTTTTTCGCCTTGCCGCTTTTGCATAACGACAAAGGCATC90                                CATACGAACGCCGTCTACGGGTTTACGATGATGTTGAATAAAATT135                               TTGGCGGAAGAAGAGCCAACTCATATGCTTGTCGCGTTTGACGCC180                               GGGAAAACGACGTTCCGGCATGAAGCGTTTCAAGAGTATAAAGGT225                               GGGCGCCAGCAGACGCCACCGGAGCTGTCGGAGCAGTTTCCGCTG270                               TTGCGCGAGCTGCTGAGGGCGTATCGCATCCCCGCCTATGAACTC315                               GAGAACTACGAAGCGGACGATATTATCGGAACGCTTGCCGCCCGC360                               GCTGAGCAGGAAGGGTTTGAGGTGAAAGTCATTTCCGGCGACCGC405                               GATCTGACCCAGCTCGCCTCCCCCCATGTGACGGTGGACATTACG450                               AAAAAAGGGATTACCGATATCGAACCGTACACGCCGGAGGCGGTC495                               CGCGAAAAATACGGCTTAACTCCGGAACAAATCGTTGATTTGAAA540                               GGATTGATGGGCGACAAATCGGACAACATTCCCGGAGTGCCGGGC585                               ATCGGGGAAAAGACGGCGGTCAAGCTGCTCAGGCAATTCGGCACG630                               GTCGAAAACGTGCTTGCCTCCATTGACGAGATCAAAGGCGAAAAG675                               TTGAAAGAAACGCTGCGCCAACACCGGGAGATGGCGCTGTTAAGC720                               AAAAAGCTCGCCGCCATTCGCCGCGACGCCCCGGTCGAGCTCTCG765                               CTTGATGACATCGCCTATCAAGGGGAAGACCGGGAGAAAGTGGTC810                               GCTTTATTTAAAGAGCTTGGGTTTCAATCGTTTTTAGAGAAAATG855                               GAATCGCCGTCATCAGAAGAGGAAAAACCGCTTGCCAAGATGGCA900                               TTTACGCTTGCTGACCGCGTGACGGAGGAGATGCTTGCCGACAAG945                               GCGGCGCTTGTCGTTGAAGTGGTCGAGGAAAATTATCATGATGCG990                               CCGATCGTCGGCATCGCTGTGGTCAACGAACATGGACGGTTTTTC1035                              CTGCGCCCGGAGACGGCGCTTGCCGATCCGCAGTTTGTCGCCTGG1080                              CTTGGTGATGAAACGAAGAAAAAAAGCATGTTTGACTCAAAGCGC1125                              GCGGCAGTCGCCTTGAAATGGAAAGGAATTGAGCTATGCGGCGTT1170                              TCCTTTGATTTATTGCTGGCCGCCTATTTGCTTGATCCGGCGCAA1215                              GGTGTTGATGATGTGGCTGCCGCAGCAAAAATGAAGCAATACGAA1260                              GCGGTGCGCCCGGATGAAGCGGTGTATGGCAAAGGGGCGAAGCGG1305                              GCCGTGCCGGATGAGCCAGTGCTCGCCGAGCATTTGGTCCGCAAG1350                              GCGGCGGCGATTTGGGCGCTCGAACGGCCGTTTTTGGATGAGCTG1395                              CGCCGCAACGAACAAGATCGGTTGCTCGTCGAGCTCGAGCAGCCG1440                              TTGTCTTCGATTTTGGCGGAAATGGAATTTGCCGGAGTGAAAGTG1485                              GATACGAAGCGGCTCGAACAGATGGGCGAAGAGCTCGCCGAGCAG1530                              CTGCGCACGGTCGAGCAGCGCATTTATGAGCTCGCCGGCCAAGAA1575                              TTCAACATCAATTCACCGAAACAGCTCGGCGTCATTTTATTTGAA1620                              AAACTGCAGCTGCCCGTCTTGAAAAAAAGCAAAACCGGCTACTCC1665                              ACTTCGGCGGATGTGCTTGAAAAACTTGCGCCTTATCACGAGATC1710                              GTGGAAAACATTTTGCAACATTACCGCCAGCTTGGCAAGTTGCAG1755                              TCGACGTATATTGAAGGATTGCTGAAAGTCGTGCGACCCGATACA1800                              AAGAAGGTGCATACGATTTTCAATCAGGCGTTGACGCAAACCGGA1845                              CGGCTCAGCTCGACGGAGCCGAACTTGCAAAACATTCCGATTCGG1890                              CTTGAGGAAGGACGGAAAATCCGCCAAGCGTTCGTGCCGTCGGAG1935                              TCTGATTGGCTCATTTTCGCTGCCGACTACTCGCAAATTGAGTTG1980                              CGCGTCCTCGCCCATATTGCGGAAGATGACAATTTAATGGAAGCG2025                              TTCCGCCGCGATTTGGATATCCATACGAAAACAGCGATGGACATT2070                              TTCCAAGTGAGCGAGGACGAAGTGACGCCCAACATGCGCCGTCAG2115                              GCGAAGGCGGTCAACTTTGGGATCGTTTACGGGATCAGTGATTAC2160                              GGCTTGGCGCAAAACTTAAATATTTCACGCAAAGAGGCCGCTGAA2205                              TTCATCGAGCGCTACTTCGAAAGCTTCCCTGGCGTGAAGCGGTAT2250                              ATGGAAAACATTGTGCAAGAAGCAAAACAGAAAGGGTATGTGACG2295                              ACGCTGCTGCATCGGCGCCGCTATTTGCCGGATATTACGAGCCGC2340                              AACTTCAACGTCCGCAGCTTTGCTGAACGGATGGCGATGAACACG2385                              CCGATTCAAGGGAGCGCCGCTGACATTATTAAAAAGGCGATGATC2430                              GATCTGAACGCCAGACTGAAGGAAGAGCGGCTGCAAGCGCGCCTT2475                              TTGCTGCAGGTGCATGACGAGCTCATTTTGGAGGCGCCGAAAGAA2520                              GAGATGGAGCGGCTGTGCCGGCTCGTTCCGGAAGTGATGGAGCAA2565                              GCGGTCACACTTCGCGTGCCGCTCAAAGTCGATTACCATTACGGC2610                              TCGACATGGTATGACGCGAAA2631                                                      (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 877 amino acid residues                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM:                                                                  (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       MetLysLysLysLeuValLeuIleAspGlySerSerValAlaTyr                                  151015                                                                         ArgAlaPhePheAlaLeuProLeuLeuHisAsnAspLysGlyIle                                  202530                                                                         HisThrAsnAlaValTyrGlyPheThrMetMetLeuAsnLysIle                                  354045                                                                         LeuAlaGluGluGluProThrHisMetLeuValAlaPheAspAla                                  505560                                                                         GlyLysThrThrPheArgHisGluAlaPheGlnGluTyrLysGly                                  657075                                                                         GlyArgGlnGlnThrProProGluLeuSerGluGlnPheProLeu                                  808590                                                                         LeuArgGluLeuLeuArgAlaTyrArgIleProAlaTyrGluLeu                                  95100105                                                                       GluAsnTyrGluAlaAspAspIleIleGlyThrLeuAlaAlaArg                                  110115120                                                                      AlaGluGlnGluGlyPheGluValLysValIleSerGlyAspArg                                  125130135                                                                      AspLeuThrGlnLeuAlaSerProHisValThrValAspIleThr                                  140145150                                                                      LysLysGlyIleThrAspIleGluProTyrThrProGluAlaVal                                  155160165                                                                      ArgGluLysTyrGlyLeuThrProGluGlnIleValAspLeuLys                                  170175180                                                                      GlyLeuMetGlyAspLysSerAspAsnIleProGlyValProGly                                  185190195                                                                      IleGlyGluLysThrAlaValLysLeuLeuArgGlnPheGlyThr                                  200205210                                                                      ValGluAsnValLeuAlaSerIleAspGluIleLysGlyGluLys                                  215220225                                                                      LeuLysGluThrLeuArgGlnHisArgGluMetAlaLeuLeuSer                                  230235240                                                                      LysLysLeuAlaAlaIleArgArgAspAlaProValGluLeuSer                                  245250255                                                                      LeuAspAspIleAlaTyrGlnGlyGluAspArgGluLysValVal                                  260265270                                                                      AlaLeuPheLysGluLeuGlyPheGlnSerPheLeuGluLysMet                                  275280285                                                                      GluSerProSerSerGluGluGluLysProLeuAlaLysMetAla                                  290295300                                                                      PheThrLeuAlaAspArgValThrGluGluMetLeuAlaAspLys                                  305310315                                                                      AlaAlaLeuValValGluValValGluGluAsnTyrHisAspAla                                  320325330                                                                      ProIleValGlyIleAlaValValAsnGluHisGlyArgPhePhe                                  335340345                                                                      LeuArgProGluThrAlaLeuAlaAspProGlnPheValAlaTrp                                  350355360                                                                      LeuGlyAspGluThrLysLysLysSerMetPheAspSerLysArg                                  365370375                                                                      AlaAlaValAlaLeuLysTrpLysGlyIleGluLeuCysGlyVal                                  380385390                                                                      SerPheAspLeuLeuLeuAlaAlaTyrLeuLeuAspProAlaGln                                  395400405                                                                      GlyValAspAspValAlaAlaAlaAlaLysMetLysGlnTyrGlu                                  410415420                                                                      AlaValArgProAspGluAlaValTyrGlyLysGlyAlaLysArg                                  425430435                                                                      AlaValProAspGluProValLeuAlaGluHisLeuValArgLys                                  440445450                                                                      AlaAlaAlaIleTrpAlaLeuGluArgProPheLeuAspGluLeu                                  455460465                                                                      ArgArgAsnGluGlnAspArgLeuLeuValGluLeuGluGlnPro                                  470475480                                                                      LeuSerSerIleLeuAlaGluMetGluPheAlaGlyValLysVal                                  485490495                                                                      AspThrLysArgLeuGluGlnMetGlyGluGluLeuAlaGluGln                                  500505510                                                                      LeuArgThrValGluGlnArgIleTyrGluLeuAlaGlyGlnGlu                                  515520525                                                                      PheAsnIleAsnSerProLysGlnLeuGlyValIleLeuPheGlu                                  530535540                                                                      LysLeuGlnLeuProValLeuLysLysSerLysThrGlyTyrSer                                  545550555                                                                      ThrSerAlaAspValLeuGluLysLeuAlaProTyrHisGluIle                                  560565570                                                                      ValGluAsnIleLeuGlnHisTyrArgGlnLeuGlyLysLeuGln                                  575580585                                                                      SerThrTyrIleGluGlyLeuLeuLysValValArgProAspThr                                  590595600                                                                      LysLysValHisThrIlePheAsnGlnAlaLeuThrGlnThrGly                                  605610615                                                                      ArgLeuSerSerThrGluProAsnLeuGlnAsnIleProIleArg                                  620625630                                                                      LeuGluGluGlyArgLysIleArgGlnAlaPheValProSerGlu                                  635640645                                                                      SerAspTrpLeuIlePheAlaAlaAspTyrSerGlnIleGluLeu                                  650655660                                                                      ArgValLeuAlaHisIleAlaGluAspAspAsnLeuMetGluAla                                  665670675                                                                      PheArgArgAspLeuAspIleHisThrLysThrAlaMetAspIle                                  680685690                                                                      PheGlnValSerGluAspGluValThrProAsnMetArgArgGln                                  695700705                                                                      AlaLysAlaValAsnPheGlyIleValTyrGlyIleSerAspTyr                                  710715720                                                                      GlyLeuAlaGlnAsnLeuAsnIleSerArgLysGluAlaAlaGlu                                  725730735                                                                      PheIleGluArgTyrPheGluSerPheProGlyValLysArgTyr                                  740745750                                                                      MetGluAsnIleValGlnGluAlaLysGlnLysGlyTyrValThr                                  755760765                                                                      ThrLeuLeuHisArgArgArgTyrLeuProAspIleThrSerArg                                  770775780                                                                      AsnPheAsnValArgSerPheAlaGluArgMetAlaMetAsnThr                                  785790795                                                                      ProIleGlnGlySerAlaAlaAspIleIleLysLysAlaMetIle                                  800805810                                                                      AspLeuAsnAlaArgLeuLysGluGluArgLeuGlnAlaArgLeu                                  815820825                                                                      LeuLeuGlnValHisAspGluLeuIleLeuGluAlaProLysGlu                                  830835840                                                                      GluMetGluArgLeuCysArgLeuValProGluValMetGluGln                                  845850855                                                                      AlaValThrLeuArgValProLeuLysValAspTyrHisTyrGly                                  860865870                                                                      SerThrTrpTyrAspAlaLys                                                          875                                                                            (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 amino acid residues                                             (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE: N-terminal fragment                                         (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM:                                                                  (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       MetGluSerProSerSerGluGluGluLys                                                 1510                                                                           (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1779 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: genomic DNA                                                (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Bacillus caldotenax                                              (B) STRAIN: YT-G(DSM406)                                                       (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       ATGGAATCGCCGTCATCAGAAGAGGAAAAACCGCTTGCCAAGATG45                                GCATTTACGCTTGCTGACCGCGTGACGGAGGAGATGCTTGCCGAC90                                AAGGCGGCGCTTGTCGTTGAAGTGGTCGAGGAAAATTATCATGAT135                               GCGCCGATCGTCGGCATCGCTGTGGTCAACGAACATGGACGGTTT180                               TTCCTGCGCCCGGAGACGGCGCTTGCCGATCCGCAGTTTGTCGCC225                               TGGCTTGGTGATGAAACGAAGAAAAAAAGCATGTTTGACTCAAAG270                               CGCGCGGCAGTCGCCTTGAAATGGAAAGGAATTGAGCTATGCGGC315                               GTTTCCTTTGATTTATTGCTGGCCGCCTATTTGCTTGATCCGGCG360                               CAAGGTGTTGATGATGTGGCTGCCGCAGCAAAAATGAAGCAATAC405                               GAAGCGGTGCGCCCGGATGAAGCGGTGTATGGCAAAGGGGCGAAG450                               CGGGCCGTGCCGGATGAGCCAGTGCTCGCCGAGCATTTGGTCCGC495                               AAGGCGGCGGCGATTTGGGCGCTCGAACGGCCGTTTTTGGATGAG540                               CTGCGCCGCAACGAACAAGATCGGTTGCTCGTCGAGCTCGAGCAG585                               CCGTTGTCTTCGATTTTGGCGGAAATGGAATTTGCCGGAGTGAAA630                               GTGGATACGAAGCGGCTCGAACAGATGGGCGAAGAGCTCGCCGAG675                               CAGCTGCGCACGGTCGAGCAGCGCATTTATGAGCTCGCCGGCCAA720                               GAATTCAACATCAATTCACCGAAACAGCTCGGCGTCATTTTATTT765                               GAAAAACTGCAGCTGCCCGTCTTGAAAAAAAGCAAAACCGGCTAC810                               TCCACTTCGGCGGATGTGCTTGAAAAACTTGCGCCTTATCACGAG855                               ATCGTGGAAAACATTTTGCAACATTACCGCCAGCTTGGCAAGTTG900                               CAGTCGACGTATATTGAAGGATTGCTGAAAGTCGTGCGACCCGAT945                               ACAAAGAAGGTGCATACGATTTTCAATCAGGCGTTGACGCAAACC990                               GGACGGCTCAGCTCGACGGAGCCGAACTTGCAAAACATTCCGATT1035                              CGGCTTGAGGAAGGACGGAAAATCCGCCAAGCGTTCGTGCCGTCG1080                              GAGTCTGATTGGCTCATTTTCGCTGCCGACTACTCGCAAATTGAG1125                              TTGCGCGTCCTCGCCCATATTGCGGAAGATGACAATTTAATGGAA1170                              GCGTTCCGCCGCGATTTGGATATCCATACGAAAACAGCGATGGAC1215                              ATTTTCCAAGTGAGCGAGGACGAAGTGACGCCCAACATGCGCCGT1260                              CAGGCGAAGGCGGTCAACTTTGGGATCGTTTACGGGATCAGTGAT1305                              TACGGCTTGGCGCAAAACTTAAATATTTCACGCAAAGAGGCCGCT1350                              GAATTCATCGAGCGCTACTTCGAAAGCTTCCCTGGCGTGAAGCGG1395                              TATATGGAAAACATTGTGCAAGAAGCAAAACAGAAAGGGTATGTG1440                              ACGACGCTGCTGCATCGGCGCCGCTATTTGCCGGATATTACGAGC1485                              CGCAACTTCAACGTCCGCAGCTTTGCTGAACGGATGGCGATGAAC1530                              ACGCCGATTCAAGGGAGCGCCGCTGACATTATTAAAAAGGCGATG1575                              ATCGATCTGAACGCCAGACTGAAGGAAGAGCGGCTGCAAGCGCGC1620                              CTTTTGCTGCAGGTGCATGACGAGCTCATTTTGGAGGCGCCGAAA1665                              GAAGAGATGGAGCGGCTGTGCCGGCTCGTTCCGGAAGTGATGGAG1710                              CAAGCGGTCACACTTCGCGTGCCGCTCAAAGTCGATTACCATTAC1755                              GGCTCGACATGGTATGACGCGAAA1779                                                   (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 593 amino acid residues                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (iii) HYPOTHETICAL:                                                            (iv) ANTI-SENSE:                                                               (v) FRAGMENT TYPE:                                                             (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM:                                                                  (B) STRAIN:                                                                    (C) INDIVIDUAL ISOLATE:                                                        (D) DEVELOPMENTAL STAGE:                                                       (E) HAPLOTYPE:                                                                 (F) TISSUE TYPE:                                                               (G) CELL TYPE:                                                                 (H) CELL LINE:                                                                 (I) ORGANELLE:                                                                 (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY:                                                                   (B) CLONE:                                                                     (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT:                                                        (B) MAP POSITION:                                                              (C) UNITS:                                                                     (ix) FEATURE:                                                                  (A) NAME/KEY:                                                                  (B) LOCATION:                                                                  (C) IDENTIFICATION METHOD:                                                     (D) OTHER INFORMATION:                                                         (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS:                                                                   (B) TITLE:                                                                     (C) JOURNAL:                                                                   (D) VOLUME:                                                                    (E) ISSUE:                                                                     (F) PAGES:                                                                     (G) DATE:                                                                      (H) DOCUMENT NUMBER:                                                           (I) FILING DATE:                                                               (J) PUBLICATION DATE:                                                          (K) RELEVANT RESIDUES IN SEQ ID NO:                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       MetGluSerProSerSerGluGluGluLysProLeuAlaLysMet                                  151015                                                                         AlaPheThrLeuAlaAspArgValThrGluGluMetLeuAlaAsp                                  202530                                                                         LysAlaAlaLeuValValGluValValGluGluAsnTyrHisAsp                                  354045                                                                         AlaProIleValGlyIleAlaValValAsnGluHisGlyArgPhe                                  505560                                                                         PheLeuArgProGluThrAlaLeuAlaAspProGlnPheValAla                                  657075                                                                         TrpLeuGlyAspGluThrLysLysLysSerMetPheAspSerLys                                  808590                                                                         ArgAlaAlaValAlaLeuLysTrpLysGlyIleGluLeuCysGly                                  95100105                                                                       ValSerPheAspLeuLeuLeuAlaAlaTyrLeuLeuAspProAla                                  110115120                                                                      GlnGlyValAspAspValAlaAlaAlaAlaLysMetLysGlnTyr                                  125130135                                                                      GluAlaValArgProAspGluAlaValTyrGlyLysGlyAlaLys                                  140145150                                                                      ArgAlaValProAspGluProValLeuAlaGluHisLeuValArg                                  155160165                                                                      LysAlaAlaAlaIleTrpAlaLeuGluArgProPheLeuAspGlu                                  170175180                                                                      LeuArgArgAsnGluGlnAspArgLeuLeuValGluLeuGluGln                                  185190195                                                                      ProLeuSerSerIleLeuAlaGluMetGluPheAlaGlyValLys                                  200205210                                                                      ValAspThrLysArgLeuGluGlnMetGlyGluGluLeuAlaGlu                                  215220225                                                                      GlnLeuArgThrValGluGlnArgIleTyrGluLeuAlaGlyGln                                  230235240                                                                      GluPheAsnIleAsnSerProLysGlnLeuGlyValIleLeuPhe                                  245250255                                                                      GluLysLeuGlnLeuProValLeuLysLysSerLysThrGlyTyr                                  260265270                                                                      SerThrSerAlaAspValLeuGluLysLeuAlaProTyrHisGlu                                  275280285                                                                      IleValGluAsnIleLeuGlnHisTyrArgGlnLeuGlyLysLeu                                  290295300                                                                      GlnSerThrTyrIleGluGlyLeuLeuLysValValArgProAsp                                  305310315                                                                      ThrLysLysValHisThrIlePheAsnGlnAlaLeuThrGlnThr                                  320325330                                                                      GlyArgLeuSerSerThrGluProAsnLeuGlnAsnIleProIle                                  335340345                                                                      ArgLeuGluGluGlyArgLysIleArgGlnAlaPheValProSer                                  350355360                                                                      GluSerAspTrpLeuIlePheAlaAlaAspTyrSerGlnIleGlu                                  365370375                                                                      LeuArgValLeuAlaHisIleAlaGluAspAspAsnLeuMetGlu                                  380385390                                                                      AlaPheArgArgAspLeuAspIleHisThrLysThrAlaMetAsp                                  395400405                                                                      IlePheGlnValSerGluAspGluValThrProAsnMetArgArg                                  410415420                                                                      GlnAlaLysAlaValAsnPheGlyIleValTyrGlyIleSerAsp                                  425430435                                                                      TyrGlyLeuAlaGlnAsnLeuAsnIleSerArgLysGluAlaAla                                  440445450                                                                      GluPheIleGluArgTyrPheGluSerPheProGlyValLysArg                                  455460465                                                                      TyrMetGluAsnIleValGlnGluAlaLysGlnLysGlyTyrVal                                  470475480                                                                      ThrThrLeuLeuHisArgArgArgTyrLeuProAspIleThrSer                                  485490495                                                                      ArgAsnPheAsnValArgSerPheAlaGluArgMetAlaMetAsn                                  500505510                                                                      ThrProIleGlnGlySerAlaAlaAspIleIleLysLysAlaMet                                  515520525                                                                      IleAspLeuAsnAlaArgLeuLysGluGluArgLeuGlnAlaArg                                  530535540                                                                      LeuLeuLeuGlnValHisAspGluLeuIleLeuGluAlaProLys                                  545550555                                                                      GluGluMetGluArgLeuCysArgLeuValProGluValMetGlu                                  560565570                                                                      GlnAlaValThrLeuArgValProLeuLysValAspTyrHisTyr                                  575580585                                                                      GlySerThrTrpTyrAspAlaLys                                                       590                                                                            __________________________________________________________________________ 

What we claim is:
 1. An isolated and purified DNA polymerase having 3'→5' exonuclease activity and which has an amino acid sequence represented by SEQ ID NO:14 or SEQ ID NO:17 in the sequence listing. 